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I. INTRODUCTION 



Past several years have witnessed a burst of interest in nonextensive statistical mechanics, 
the topic which finds increasingly more applications particularly due to the concept of Tsallis 
entropy U, 0. In this approach, one postulates the following one-parametric family of 
concave functions, 



1 - Y -V q 

S g = (i) 

where q > 0. The family of Tsallis entropies replaces the traditional Boltzmann-Gibbs- 
Shannon entropy, Si, 



S 1 = \imS q = -y^Pilnpi. (2) 

i 

One of the important achievements associated with Tsallis entropy (H) is the fact that 
it provides an easy access - through the method of entropy maximization - to a rich set of 
distribution functions, different from the traditional Gaussian distribution function, thereby 
giving a handy description of long (non-exponential) tails of probability distributions. For 
each value of the parameter q, Tsallis entropy is a concave function of the probability. 
The characteristic feature of Tsallis entropy is that it is nonextensive for q ^ 1, that is, 
if the system is composed of two statistically independent subsystems, Tsallis entropy of 
this system is not the sum of Tsallis entropies of the subsystems. Since Tsallis entropy is 
postulated rather than derived, this point remains somewhat open to discussions || f|]. 

The goal of this paper is to present an argument on how long tails can be described in 
a usual, extensive (more precisely, almost extensive) statistical mechanics, and to give a 
theoretical derivation of a different, and in a certain sense unique, one-parametric family of 
entropy functions which can model effects of finiteness. 

Our first remark is that real-world systems, to which statistical mechanics is applied, are 
finite, and, although they do consist of a large number of subsystems, the natural logarithm 
of this number (and since we address questions related to entropies, one should observe 
the magnitude of the logarithm, in the first place) is not that big after all, it is not larger 
than 100 and is often less than 20. Extensivity, in a true sense of this notion, theorems of 
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equivalence of the microcanonical and the canonical ensembles 0, and like, is valid only in 
the thermodynamic limit where the system can be partitioned into an arbitrary large number 
of noninteracting and statistically independent subsystems. Namely, it is the number of such 
independent subsystems v which do not interact, and which are similar in all their observable 
properties to the larger system, that plays the role of the parameter the values of which tells 
how close is the system to the thermodynamic limit. 

One is up to invoking that v is finite (and by doing so, one restores to an argument about 
an incomplete extensivity) when one needs to cut off the tails of distribution functions 
with divergent averages. This fact is well known, for example, in the case of the classical 
Boltzmann equation: The maximum entropy solution for the Boltzmann entropy does not 
exist (is not normalizable) if the observables are the density, the average momentum, the 
stress tensor and the heat flux ||, 0, ||, 0, [|K|. A regularization by the argument that 
the magnitude of the microscopic velocity is restricted to the value dictated by finiteness of 
the total energy is an example of the incomplete extensivity argument. 

Thus, when the system is not strictly in the thermodynamic limit, details of the inter- 
action should gradually become more and more important, and prospects of a universal 
description using a maximization of an interaction-independent entropy functional become 
less evident. Nevertheless, the very possibility of a sufficiently reliable universal description 
in the sense just mentioned cannot be ruled out a priori. For that reason, a search for 
non-classical entropies for a possible description of nonextensive systems seems motivated. 

The structure of this paper is as follows: In the next section [Oj, we shall review, for the 
sake of completeness, the theory of Lyapunov functions of master equation. In section |T| 
we derive the family of the (almost) additive entropies from the condition of additivity of 



the entropy function for statistically independent systems. In section [TV] we demonstrate 
with a simple example how long tails are related to the effects of finiteness in the present 
approach. 

Sections [III] and |Iy| are the central point of our presentation. In section [V] we discuss 
a different scenario how the apparent statistical dependence can occur when description of 
the system is incomplete, and present a natural generalization of master equation for those 
cases in section [VT|. Finally, results are briefly discussed in section |V11 . 
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II. LYAPUNOV FUNCTIONS OF MASTER EQUATION 



We shall start our discussion with a brief summary of the theory of Markov chains. Our 
presentation essentially follows Ref. [II] . Let us consider a finite set of states E ly . . . , E N , 



and let us assume that the system can be found only in those states. The probability of 
finding the system at time t > is given by the iV-component vector p of probabilities Pi(t), 
such that pi > 0, and Yli=iPi = 1- Dynamic equation for p of the form, 

N 

Pi = ^2 QikPk, (3) 

k=l 

describes the time evolution of a Markov chain if and only if the matrix elements q^ satisfy 
Qik > for i k, and for every k, 

N 

J>* = 0. (4) 

i=l 

For each Markov chain, a graph of transition is put into correspondence by drawing an 
oriented link from the vertex Ei to the vertex E^ if q^ > 0. Important class of Markov chains 
is characterized by directional connectivity. The graph of transitions is called directionally 
connected if there is a path from each vertex to any other vertex made up of the oriented 
links. Then the following ergodic theorem is valid: Let transition graph of the Markov chain 
be directionally connected. Then there exists a positive stationary state p eq , pf 1 > 0, and 
for any initial condition p(0), solution p(t) to equation (||) tends to p* at t — > oo. 

Let the Markov chain satisfy to the ergodic theorem, and let the stationary state p* is 
known. Let h(x) be a convex and twice differentiable function of one variable x G [0, oo]. 
Any function h defines a convex Lyapunov function of the Markov chain 

N 

H h {p) = Y,P?KP,/P?)- (5) 

i=x 

Time derivative of the function Hh (|5]) due to dynamics @ is nonpositive: 

Hh = E ^PfHVi/p?) - hfa/pf) + tifa/pDifa/p?) - {Pi/P?))} < 0, (6) 

where prime denotes derivative with respect to argument. Equality sign is achieved only in 
the stationary state. 



The stationary state p eq is called the state of detail balance, if it satisfies 

QikPt q = qkiPT- ( 7 ) 

Markov chain with detail balance is colloquially termed master equation. In this case, the 
time derivative of the Lyapunov function becomes especially simple, 

H h = -\ ^PTWi^lpT) ~ h'fa/p?)] [(Pi/p?) - (p./pf)} < 0. (8) 

Discussion of the physical significance of the detail balance for the Markov chain (master 
equation) is a well known textbook material. 

Since a convex linear combination of convex function is also a convex function, the obvious 
construction which enables one to create new Lyapunov functions from given representatives 
of the family (|5|) is this: Let hi, . . . , be convex functions, and ati, . . . , be nonnegative 
numbers, on > 0, satisfying Ylm=i a «i = 1- Then, 

k N 

H aihl ...a k h k = ^2^PT a ™ h m{Pi/p7)i ( 9 ) 
m=l i=l 

is also the Lyapunov function of Markov chain. It should be stressed that the set (^) does 
not extends the family (||) already specified. 

Concluding this summary, we stress that, under physically significant restrictions on the 
existence of the stationary state, any Markov chain has a large class of Lyapunov functions 
of the form (]^), each constructed from a convex function h of one variable. The additivity 
requirement makes it possible to drastically restrict the class of physically relevant Lyapunov 
functions of Markov chains, which we do in the next section. 

III. FAMILY OF ADDITIVE LYAPUNOV FUNCTIONS 

In order to derive the family of additive Lyapunov functions, let us consider two sta- 
tistically independent systems described by probability vectors p and q, Pi > 0, qj > 0, 
£V Pi = 1, YlijPj = 1- respectively. We shall consider first the case of the equipartition at 
the equilibrium, in order to simplify notation. Thus, we assume that equilibrium states of 
both the systems are equipartitions with probability vectors p eq and qr eq , where p° q = 1/P, 
q° q = 1/(5, and where P and Q are the numbers of states in the systems. 
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Since the systems are independent, the joint system is characterized by the joint probabil- 
ity vector pq. The equilibrium of the joint system is again the equipartition, (pq) eq = p eq q eq , 
that is the equilibrium is multiplicative with respect to joining the systems if the latter are 
statistically independent. The condition of additivity for the Lyapunov function (]5p of the 
joint system reads: 

H h (pq) = H h (p) + H h (q). (10) 

This functional equation has two special solutions, corresponding to the convex functions, 
h\(x) = xhix, and h,2(x) = — \nx. We denote Hi = H x \ nx and H 2 = H_\ nx , respectively. 
Whereas the function Hi corresponds to the classical (additive) Boltzmann-Gibbs-Shannon 
entropy, we shall demonstrate here the additivity of H 2 . Indeed, 

PQ 

H 2 (pq) = - J2 P~ 1 Q- 1 MPQp t q 1 ) 

P Q 
= -\n(PQ) -J2^Pi~J2^<l3 

i=l j=l 
P Q 

i=l j=l 

= H 2 (p) + H 2 (q). 

Neglecting the irrelevant constant and constant factors, and using Eq. (|9]), we finally 
arrive at the one-parametric family of additive convex Lyapunov functions (|3p for master 
equation with iV states: 

N ^ N 

H a = (l-^^pilnpi-a—^lnpi, 0<a<l (11) 

i=l i=l 

The one-parametric family of additive Lyapunov functions is the central point of our further 
discussion. Several remarks are in order: 

Remark 1 In the thermodynamic limit, which in the case considered here corresponds for- 
mally to N —>■ oo, for any a, we have H a — ► (l — a)Hi. That is, the non-classical contribution 
due to if 2 becomes significant only if the system is not too close to the thermodynamic limit. 
In the thermodynamic limit survives only the classical Boltzmann-Gibbs-Shannon contribu- 
tion. 
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Remark 2 It is not difficult to prove that the family (|TTD exhausts all the possible ad- 
ditive Lyapunov functions of the form (|^) (up to adding a constant, and a multiplication 
with a constant factor): Indeed, the classical treatment of the additivity condition requires 
averaging the vector function hip which can be done either using p or p eq . The latter is 
the distinguished probability distribution which is, same as p, multiplicative with respect to 
joining the statistically independent subsystems. Relevance of master equation, and hence 
of kinetic rather than of static picture, to our derivation of the one-parametric family ( ITT]) is 
apparent: This enables to consider two sets of probabilities, the "current" p, and the "final" 
p cq (the equipartition here). 

Other convex functions which are additive under joining statistically independent systems 



do exist, for example, the Renyi entropy function [|12| but they are not of the form (|5|) (that 
is not of the so-called "trace form", cf. Ref. [13|]). For this reason, such functions fall out of 
our discussion since the proofs of the inequalities (|6|) and (§) are not valid for them. 

Remark 3 Function H 2 is not defined (and, consequently, any of the function H a , q ^ 
is not defined) if one of the probabilities pi equals to zero. The classical Boltzmann-Gibbs- 
Shannon solution to the additivity equation is distinguished by the property of continuity at 
Pi = 0. This is a blueprint of the long-tail features (see next section). Work with the family 
of entropies (|TTJ) assumes preserving additivity on the expense of abandoning the continuity 
of the entropy functions on closed intervals < pi < 1, and its replacement by continuity 
on semi-open intervals, < Pi < 1. 

Remark 4 To the best of our knowledge, the entropy function 

N 

S 2 = -H 2 = J2^Pi (12) 

i=l 

has been first considered by Burg in the context of applications of information theory to 
geophysical problems [TTJJ , |T5[ . Recently, the Burg entropy ( |T2D has been used to construct 



illustrations of the entropic lattice Boltzmann method in Ref. [0. However, we failed 



to find a reference to the one-parametric family (|Tl]) prior to Ref. [11]. Whereas in Ref. [I I 



the one-parametric family (|TT|) has been mentioned as just the solution to the additivity 
condition, its relevance to describing effects of finiteness in statistical systems has not been 
duly discussed. 

Remark 5 If the equilibrium p cq of the Markov chain differs from the equipartition but is 
multiplicative under joining statistically independent subsystems, the one-parametric family 



(PI) generalizes to the following: 



N ( \ N f \ 

H a = (1- a)^2pi\n(-^) -«^^ q ln(^J . (13) 

IV. LONGER TAILS: AN EXAMPLE 

In this section we want to explicitly work out an example in order to demonstrate that 
the entropies of the family ( |TTD indeed describes the long tails for When the discrete 

system of states is addressed, as it is done here, the meaning of the long tail has to be 
understood as broadening of the distribution functions. 

Without loss of generality, we shall work in this section with the one-parametric set, 

N N 

H a = (1 — a) ^^Pi In pi — a lnpj, < a < 1. (14) 

i=l i=l 

We shall consider first the microcanonic ensemble, that is, the minimizer of H a under the 
constraint of fixed normalization, YliLiP* = 1- ^ * s straightforward to see that, for any 
admissible value of the parameter a, the microcanonic state is the equipartition, as expected. 

In order to address the canonic ensemble, we introduce energies of the states > 0, and 
find the minimum of H a (|T4]) under the constraints, 

N 

Z> = 1, (15) 

i=l 

N 

J2 E iPi = U. (16) 

i=i 



Denoting the solution p( a \ we find, for a ^ 1, 



a 



rfP exp (- (1 _" a)pt ) = ex Pi A " ( 17 ) 

where A and f3 are Lagrange multipliers, corresponding to the constraints flT5|). In order to 
address the effect of a ^ 0, we shall restore to a perturbation theory around the Boltzmann- 
Gibbs-Shannon point pf . After a few algebra, we get for a <C 1, a > 0: 



= Pf ] + a ( 1 - Npf +{U- E,)^^pf ) , (18) 



S 



where 

pf = ^e^, zW = f; e -^ > (19) 

i=l 

is the canonical distribution function for the Boltzmann-Gibbs-Shannon entropy (Lagrange 
multiplier (3^ is expressed in terms of the average energy U by the constraint (|16|); we do 
not need here the explicit expression (3^°'(U)), and 

N 

V = J2 E " ( 2 °) 

i=l 
N 



c = £^ 0) , ( 21 ) 



1=1 



that is,V^ is the total energy of the states, and the denominator appearing in Eq. (|IBD, C — U 



2 



is the correlation of the energy levels E{ in the canonical state (19). We further denote, 



R - V ' NU (??) 

B ~ ~c^Tp- (22) 

It can be argued that B > 0: The total energy of the states, V, is not less (and in most 
of the relevant cases, much larger) than the average energy U times the number of states, 
whereas the correlator C — U 2 is always positive. 

Function (|i~8|) is the first-order perturbation result, and it is not a positive definite quan- 
tity. Yet, it is sufficient to our purpose here, since the question we want to address is as 
follows: What is the sign of the derivatives, dp^ / da\ a _ Q l Probabilities of the canonical 
distribution (|T9|) decay when Ei exceed the average energy U, so, by switching on the Burg 
component, do we see the "raising" of the populations of this "high-energy tail"? In order 
to see this, we obtain in Eq. ([Hp, 



da 

where the factor A- is, 



= A lP f\ (23) 

a=0 



A i = Z ( ®e? WEi -N-B{E i -U). (24) 

Factor amplifies populations of the states which are less populated in the standard canon- 
ical ensemble ( |i~9D if E{ satisfies the inequality, Ei > e, where e is the solution to the equation: 

(l/Z^)e-^° )e (N + B(e - U)) = 1. (25) 
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In order to make the situation even more transparent, we shall assume that the energies 
Ei are in a narrow band around the value E > 0, that is Ei = E + Si, YliLi ^» = 0' anc ^ 
5i <C E. All the quantities contributing to the expression (|1^) can be then evaluated in 
terms of expansion in Si (notice that the second-order perturbation in Si must be used in 
order to compute the correlation C — U 2 ). We obtain, B = (3^ + o(Sf), and, up to second 
order, eq. ( P5| ) reads 

JV 

p(N/2 - 1)5 2 + (3{l - N)8 + (3 2 {N- 1 - 1/2) ^ S 2 = 0. (26) 

i=l 

For large N, this gives that factor (|24]) is larger than zero, and hence amplifies the populations 
of the energy levels E + Si, if 

Si > 2/ $\ (27) 

That is, raising of the populations of higher energy levels is explicitly demonstrated in this 
example. We do not discuss corrections for finite iV to the estimate (P7Q which are easily 
obtained from Eq. (^). 

Thus, we have demonstrated with explicit example that taking into account the Burg 
component in the one-parametric family ( |TT1) indeed is able to describe broadening of the 
canonical distribution function. Appearance of the energy levels correlation in the above 
formula (|18|) remarkably resembles recent results of application of Tsallis entropy to fitting 
experimental data in turbulence (see Ref. |18| and references cited therein). 

Generalizations to quasi-equilibrium situation with more constraints is straightforward. 
Also, if generalizations to a continuous case of states are addressed, the long tail feature 
of the corresponding distribution functions becomes even more apparent. For example, the 
counterpart of the Gaussian distribution function P(x) (maximizer of the Boltzmann-Gibbs- 
Shannon entropy under the constraint fixing the normalization and the variance) has the 
form, P(x) ~ (A + f3x 2 )~ l . For it, algebraic decay at infinity precludes existence of the 
normalization, and a cutoff is required. 

To this end, we have argued that for systems out of the strict thermodynamic limit, there 
exists a universal (that is, not explicitly dependent on details of interactions) one-parametric 
family of additive entropy functions, which are able to describe, at least in principle, the 
same long-tail effects as the Tsallis entropy. In the remainder of this paper we shall discuss 
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a different issue of how additivity of the entropy can apparently be violated if the description 
of the system is incomplete. 



V. NON- ADDITIVITY AND INCOMPLETE DESCRIPTION 

Discussions aimed at justifying a non-additive dependence S(p) are sometimes conducted 
in a rather obscure way: One argues that the entropy is non-additive under joining the sta- 
tistically independent subsystems because, in reality, these subsystems are not independent. 
Possible physical agents which could lead to such a situation are occasionally mentioned, 
like long-range forces, for example, which seemingly justify statements like "the concept 
of independent subsystems does not make any sense, since all subsystems are interacting" 
T8| . However, to the best of our knowledge, a direct demonstration has never been done 



for any realistic system. Moreover, one should be more cautious in rejecting the "concept 
of independent subsystems" , especially when discrete systems as above are addressed, since 
this may lead to a confrontation with the traditional axiomatic (Kolmogorov's) probability 
theory which it is strongly based on this concept JTP| (in the worst case, one should abandon 
the concept of independent trials which is at the very hart of the definition of probabilities). 

This all leads to a question: If the probability distribution over pairs of states pij factors 
into products of distributions, p^ = q^j, but subsystems are dependent, then where this 
dependence is hidden? What is this "new" notion of independence which does not coincide 
with the classical, Prob(A\B) = Prob(A) means A is independent of Bl In order to answer 
this question, one should realize that such a situation of a "hidden" dependence, in fact, 
has been long known in physics. This is the Pauli exclusion principle. The corresponding 
Fermi-Dirac entropy has the well known form: 



S (P) = - m ^ + (! - Pi) m (l - Pi)}- (28) 

i 

This expression can be interpreted in the following way: With the electron gas, there is 
associated a gas of "places" (holes). The state of the ensemble of this gas of holes is uniquely 
determined by the ensemble of the electrons, p^hoie = 1 — Pi- If, for two subsystems of the 
electrons, = q^rj, then, for the corresponding ensembles of holes, we have p^hoie — 1 — <7i r j, 
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and the corresponding product for the subsystems reads, 

(1 - -n) = l-qi- Tj + qiTj ^ Pinhole- 

Therefore, subsystems of the electrons are dependent even for the multiplicative Pij = q^Tj. 

One can term this the effect of hidden components. It should be stressed that we speak, 
in fact, about an incomplete description (both the ensembles are uniquely related to each 
other), namely, that there are hidden components whose entropy has to be taken into ac- 
count. 

A different example, without any reference to quantum effects, is the entropy of mono- 
layers on the surface of a solid (see, e. g. [p0|0 . in the simplest case, the entropy density, up 
to constant factors and constants, has the form, 

S = -c AZ \n(c AZ / c* AZ ) - c z ln(c z / c* z ) , (29) 

where A denotes molecules of the gas, Z is the vacant position on the surface (adsorbing 
center), AZ is the adsorbed molecule, c denotes corresponding surface concentrations, * 
denotes equilibrium concentrations, and since cz + c A z = const (the number of places per 
unit area is conserved) Eq. (|29| ) is again the Fermi-Dirac entropy, obtained without any 
relation to quantum effects. 

Thus, to sum up, the simplest known version of violating the additivity implies existence 
of subsystems of "locations" , "holes" , "ghosts" and like. These subsystems occupy the same 
states as the "observed" system, with probabilities, 

g» = 1 - api, a G [0, 1], or 

q { = (1 - a) + api (30) 

(We have stressed two possible cases, with a positive and with a negative constraint.) There 
might be several such hidden subsystems, and thus 

S = S(p) + Y;a J S j (qM), (31) 

j 

where j is the label of the hidden subsystem, and aj > 0. What the hidden subsystems could 
be? For example, they can describe various nonideal effects like excluded volume in various 
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spaces (not obligatory in the physical R 3 , as in the example of adsorbing centers) . Other in- 
terpretations are probably possible. Here we do not consider any specific examples. Rather, 
we want to emphasize remarkable approximation possibilities provided by expression fl3T|) 
when the entropies S a are used. Indeed, already for just one hidden subsystem we have four 
fitting parameters (two coefficients in front of the Burg component for the system and for 
the ghost, one coefficient a in the constraint (|3T)D, and 1 — a for the ghost). Because approxi- 
mations to the experimental data obtained by the maximum entropy principle under certain 
constraints, whereas the choice of these coefficients is yet another optimization problem, it 
is not difficult, in principle, to organize a procedure of choosing the parameters (learning or 
fitting) in such a way as it is done in neural networks based on the error back propagation 
algorithm. Each time, an intriguing question will be arising, as to how many ghosts are 
needed for a description with a given accuracy, and what is the physical interpretation of 
those. 

VI. HIDDEN SUBSYSTEMS CHANGING KINETICS 

So far, all our considerations of the entropies, including either the Tsallis entropy, or 
the family S a , as well as of the entropy of the ghosts ( |31]) have left a nice option that 
they allow to do nothing about the master equation. Namely, all these entropies can be 
used to describe all kinds of incompletely known or restricted equilibria, or for constructing 
(generalized) canonical ensembles of dynamically conserved or quasi-conserved quantities. If 
the probability evolves in time according to the master equation, all these entropy functions 
behave correctly, that is, they monotonically increase with the time. This fact is well known, 
and it was reviewed above in section [II]. 

In other words, as long as the hidden subsystem is described by the same set of states, 
as the observed one, no restrictions on the Markov kinetic equation arise. However, if 
more freedom is allowed in the choice of the entropy, kinetics has to be modified. Indeed, 
for Markov chain satisfying the detail balance condition (|7|), the natural condition which 
defines the equilibrium of the transition p± ^ pj can be written, 

dpi dpj 
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For the entropy function of the form S = —Hh, where Hh is given by equation (|^), this 
writes, 

h , (pi/p?) = h , (p j /if), (33) 

and, because of strict monotonicity, this results in the definition of the equilibrium, Pi/p eq = 
Pj/Pj 1 - Master equation (^) with the detail balance condition can be rewritten in such a 
way as to make it apparently consistent with the latter result: We denote, 

Wij = QijPf, 
then master equation (H) can be rewritten, 

TV 

Pi = Y^ w a [(Pj/p?) ~ iPi/vT)} • ( 34 ) 

i=i 

However, if the entropy of the hidden subsystem does not have the form Sh = —Hh, with 
Hh given by Eq. ([5]), for example, if it includes terms like, 

N N 
i=l i=l 

then condition ( |3"2| ) results in a more complicated equation, which, unlike Eq. (|3"BD, brings 
in the dependence on all the components of the vector p. In that case, a model kinetic 
equation, more general than the master equations can be addressed. 

There exist a universal way of constructing kinetic equations in a way consistent with the 
given entropy. Let us introduce notation, = —dS/dpi, and let ^>(x) be a monotonically 
increasing function. Then we define the rate of transitions pi — > pj as 

Wijip^ini), (35) 

where = w^, w^j > is a symmetric matrix with nonnegative entries (which might also 
be functions of the probability distribution p). Given the rates fl35"D, kinetic equation takes 
the form: 

j 
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Equation ([36]) is a generalization of the Marcelin - De Donder kinetic formalism (see, for 



instance, pL pO, pl| 



Equation fl3"6|) becomes especially natural to use if the entropy of the system has the form, 

S = S h + S, (37) 

where the part Sh = —H^ has standard form ([5]) for some convex function h while S is the 
entropy of the (part of the) hidden subsystem which does not have such form. Then we 
put, ty(x) = [— that is, ^ is the inverse of the derivative —h'. With this, Eq. fl3"6] ) 
becomes: 

TV 

Pi = ([-^"'(^(Pi/P?) - ^/<%) - [-^T^W^D - . (38) 

i=l 

This is the minimal extension of the master equation: If the hidden system can be described 
with the same entropy (S = 0), equation fl38|) reduces to master equation. However, in 
any case, extensions ([36]) and ( |38"D are consistent with the entropy increase in the kinetic 
processes. 



VII. DISCUSSION 



Once a classical statistical system is out of the thermodynamic limit, the exclusive charac- 
ter of the Boltzmann-Gibbs-Shannon entropy is lost, and classical ensembles are not equiv- 
alent anymore. Whereas using the microcanonical ensemble for any description of finite 
systems may be most appropriate, this route is very complicated from a computational 
standpoint. For that reason, seeking an entropic description of effects of fmiteness seems a 
relevant option. 

In this paper, we have demonstrated that that there exists a unique one-parametric family 
of entropy functions which are consistent with the additivity of the entropy under joining 
statistically independent subsystems. This family is essentially the convex combination 
of the Boltzmann-Gibbs-Shannon entropy and the Burg entropy. This family of entropy 
functions appears in a natural way as a distinguished (by the additivity requirement) subset 
of the family of Lyapunov functions of master equation. It has been demonstrated that the 
nontrivial contribution from the Burg component results in broadening of the high-energy 
tail of the canonical distribution function. The functional form of the deviation, and, in 
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particular, the appearance of the energy correlations indicates that the maximum entropy 
approach successively used recently in the context of Tsallis entropy may lead to similar 
results when the present entropy functions are used. Detailed study of this option is left for 
the future work. 

Finiteness of the classical statistical system is one option which calls for non-classical 
entropies. A different (independent) option is the incompleteness of the description. This 
has been demonstrated by analyzing the classical example of Fermi-Dirac type of entropy, 
and a generalization in the form of "standard entropy for a multi-component mixture plus 
linear constraints" has been suggested. Finally, we have suggested a minimal modification 
of master equation consistent with the given entropy. 
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